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Abstract 

We propose a novel and natural architecture for decentralized control that is applicable whenever 
the underlying system has the structure of a partially ordered set (poset). This controller architecture 
is based on the concept of Mobius inversion for posets, and enjoys simple and appealing separation 
properties, since the closed-loop dynamics can be analyzed in terms of decoupled subsystems. The 
controller structure provides rich and interesting connections between concepts from order theory such 
as Mobius inversion and control-theoretic concepts such as state prediction, correction, and separability. 
In addition, using our earlier results on ^-optimal decentralized control for arbitrary posets, we prove 
that the T^-optimal controller in fact possesses the proposed structure, thereby establishing the optimality 
of the new controller architecture. 

I. Introduction 

The prevalence of large-scale complex systems in many areas of engineering has emphasized 
the need for a systematic study of decentralized control. While the problem of decentralized 
control in full generality remains a challenging task, certain classes of problems have been 
shown to be more tractable than others [11], [8], [12], [13]. 

Motivated by the intuition that acyclic structures within the context of decentralized control 
should be tractable, the authors began a systematic study of a class of systems known as poset- 
causal systems in [11]. In follow-up work [9], [14] we showed that the problem of computing 
< H 2 -optimal controllers using state-space techniques over this class of systems was tractable, with 
efficient solutions in terms of uncoupled Riccati equations. We also provided several intuitive 
explanations of the controller structure, though a detailed analysis of the same was not presented. 

In this paper we are concerned with the following questions: "What is a sensible architecture 
of controllers for poset-causal systems? What should be the role of controller states, and what 
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computations should be involved in the controller?" This paper focuses on answering this 
architectural question. Our main contributions in this paper are: 

• We propose a controller architecture that involves natural concepts from order theory and 
control theory as building blocks. 

• We show that a natural coordinate transformation of the state variables yields a novel 
separation principle. 

. We show that the optimal r H 2 controller (with state-feedback) studied in [14] has precisely 
the proposed controller structure. 

. We establish novel connections that tie together three well-known concepts: (a) Youla pa- 
rameterization in control, (b) the concept of purified output feedback in robust optimization 
and (c) Mobius inversion on posets. 
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Fig. 1. A block-diagram representation of the control architecture. The simulator predicts the unknown states at each subsystem 
using available information (the prediction at subsystem k is denoted by X k ). The controller then computes the differential 
improvement in the prediction using ji, acts on it with a local gain F(k) and then "integrates" the signal along the poset using 
£ to produce the input signal. 



The controller structure that we propose in this paper is as follows. At each subsystem of 
the overall system, the partial ordering of the information structure allows one to decompose 
the global state into "upstream" states (i.e. states that are available), "downstream" (these are 
unavailable) and "off-stream" states (corresponding to uncomparable elements of the poset). The 
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downstream and off-stream states are (partially) predicted using available upstream information 
using a "simulator" (see Fig. 1), this prediction is the role of the controller states. The best 
available information of the global state at each subsystem is then described using a matrix 
X; each column of X corresponds to the best local guess or estimate of the overall state at a 
particular subsystem. 

Having computed these local partial estimates, the controller then performs certain natural 
local operations on X that preserve the structure of the poset. These local operations are the 
well-known £ and operations in Mobius inversion. These operations, which are intimately 
related to the inclusion-exclusion formula and its generalizations, have a rich and interesting 
theory, and appear in a variety of mathematical contexts [7]. The control inputs are of the form 
U = £(F o fi(X)). As we will see later, the operators /i and £ can be interpreted as generalized 
notions of differentiation and integration on the poset so that fi(X) may be interpreted as the 
differential improvement in the prediction of the local state. Here F = {F(l), . . . , F(s)} are 
feedback gain matrices corresponding to the different subsystems. The quantity F o jx(X) may 
therefore be interpreted as a local "differential contribution" to the overall control signal. The 
overall control law then aggregates all these local contributions by "integration" along the poset 
using This architecture has been shown diagrammaticaily in Fig. 1. 

Computational and architectural issues in decentralized control have been important areas of 
study; we mention some related works below. From a computational standpoint, the problem of 
computing T^-optimal controllers for quadratically invariant systems was studied in [8], however 
that approach does not provide much insight into the structure of the optimal controller. In the 
context of decentralized control, the computational and architectural issues for the "Two-Player 
Case" were studied in [16]. This work was extended to arbitrary posets in [14] (similar results 
were obtained in [15]), and some hints regarding the structure of the optimal controller were 
provided in our previous work. Another important related work is the simpler but related team- 
theory problem over posets studied in [6] which provides us with an interesting starting point 
in this paper. We mention the work of Witsenhausen [17], [18] who provided important insight 
regarding different types of information constraints in control problems. Finally, team theory and 
decentralized control have also been studied in [4]. 

The rest of this paper is organized as follows: In Section II we introduce the necessary 
order-theoretic and control-theoretic preliminaries. In Section III we present the basic building 
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blocks involved in the controller architecture. In Section IV we describe in detail the proposed 
architecture, establish the separability principle and explain its optimality property with respect 
to the 'H2 norm. In Section V, we discuss a block diagram perspective to interpret our results. In 
Section VI, we discuss connections to the Youla parameterization and the literature on purified 
output feedback. 

II. Preliminaries 

In this section we introduce some concepts from order theory. Most of these concepts are 
well-studied and fairly standard, we refer the reader to [1], [3] for details. 



Definition 1: A partially ordered set (or poset) P = (P, <) consists of a set P along with a 
binary relation < which is reflexive, anti- symmetric and transitive [1]. 

We will sometimes use the notation a < b to denote the strict order relation a < b but a ^ b. 
An important related concept is that of a product of two posets. 

Definition 2: Let P = (P, <<p) and Q = (Q, <q) be two posets. We define their product poset 
P x (3 = (P x Q, <<pxq) to be the set P x Q equipped with the order relation <p x q satisfying 



It may be easily verified that <p x q as defined above constitutes a partial order relation. 

Most of this paper we will deal with finite posets (i.e. \P\ is finite). It is possible to represent 
a poset graphically via a Hasse diagram by representing the transitive reduction of the poset as 
a graph [1]. 

Example 1: An example of a poset with three elements (i.e., P = {1, 2, 3}) with order relations 
1 < 2 and 1 < 3 is shown in Figure 2(b). 



A. Posets 



(puqi) <p X Q (P2,qi) if Pi <p P2 and q x <q q 2 . 




Fig. 2. Hasse diagrams of some posets. 
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Let V = (P, <) be a poset and let p e P. We define lp = {qeP\p<q) (we call this the 
downstream set). Let Hp = {q e P \ p < q,q ^ p\. Similarly, let T p = {q e P \ q < p\ (called a 
upstream set), and TTp = {q & P \ q < p,q p}- We define J,Tp = {q & P \ q £ p,q £ p} (called 
the off-stream set); this is the set of uncomparable elements that have no order relation with 
respect to p. Define an interval [i,j] = {p e P \ i < p < j). A minimal element of the poset is an 
element p e P such that if q < p for some q e P then q = p. (A maximal element is defined 
analogously). 

In the poset shown in Figure 2(d), |1 = {1,2,3,4}, whereas ||1 = {2,3,4}. Similarly ffl = 
T4 = {1,2,3,4}, and TT4 = {1,2,3}. The set ||2 = {3}. 

Definition 3: Let V = (P, <) be a poset. Let Q be a ring. The set of all functions / : PxP — > 
Q with the property that f(x,y) = if y ^ x is called the incidence algebra of V over Q. It is 
denoted by I(P). * 

When the poset V is finite, the elements in the incidence algebra may be thought of as matrices 
with a specific sparsity pattern given by the order relations of the poset in the following way. 
An example of an element of I(P) for the poset from Example 1 (Fig. 2(b)) is: 



1 
1 1 
1 1 



Given two functions f,g e 1(f), their sum f+g and scalar multiplication cf are defined as usual. 
The product h = f -g is defined by h(x,y) = £ ze p f(x, z)g(z,y). Note that the above definition of 
function multiplication is made so that it is consistent with standard matrix multiplication. It is 
well-known that the incidence algebra is an associative algebra [1], [11]. 

B. Control Theoretic Preliminaries 

1) Poset-causal systems: We consider the following state-space system in continuous time: 

'Standard definitions of the incidence algebra use the opposite convention, namely f{x,y) = if x ^ y so their matrix 
representation typically has upper triangular structure. We reverse the convention so that they are lower-triangular, and thus 
in a control- theoretic setting one may interpret them as representing poset-causal maps. This reversal of convention entails 
transposing other standard objects like the zeta and the Mobius operators. For the same reason, we also reverse the convention 
of drawing Hasse diagrams so that minimal elements appear at the top of the poset. 
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x(t) = Ax(t) + w(t) + Bu(t) 

z(t) = Cx(t) + Du(t) (1) 
y(t) = x(t). 

In this paper we present the continuous time case only, however, we wish to emphasize that 
analogous results hold in discrete time in a straightforward manner. In this paper we consider what 
we will call poset-causal systems. We think of the system matrices (A, B, C, D) to be partitioned 
into blocks in the following natural way. Let V = (P, <) be a poset with P = {1, . . . , s}. We 
think of this system as being divided into s subsystems, with subsystem i having some states 
Xi(t) 6 R" 1 ', and we let N = Hiep"/ be the total degree of the system. The control inputs at 
the subsystems are Uj(t) e R mi for i e {l,...,s}. The external output is z(t) e R p . The signal 
w{t) is a disturbance signal. The states and inputs are partitioned in the natural way such that 
the subsystems correspond to elements of the poset V with x(t) = [x\(t) \x 2 (t) \. . . \x s (t) ] T , and 
u(t) = [wi(0 \u2(i) |. . . \u s (t) ] T . This naturally partitions the matrices A,B,C,D into appropriate 
blocks so that A = ^A i; j i&p , B = ^, C = [Cy] (partitioned into columns), D = j 
(We will throughout deal with matrices at this block-matrix level, so that A t j will unambiguously 
mean the block of the matrix A.) Using these block partitions, one can define the incidence 
algebra at the block matrix level in the natural way. The block sizes will be obvious from the 
context and we denote by I(P) the block incidence algebra. 

Remark In this paper, for notational simplicity we will assume n,- = 1, and m, = 1. We emphasize 
that this is only done to simplify the presentation; the results hold for arbitrary block sizes 
and m, by interpreting the formulas "block-wise" in the obvious way. 

We call such systems poset-causal due to the following causality-like property among the 
subsystems. If an input is applied to subsystem i via w, at some time t, the effect of the input 
is seen by the downstream states xj for all subsystems j e| i (at or after time t). Thus | i 
may be seen as the cone of influence of input i. We refer to this causality-like property as 
poset-causality. This notion of causality enforces (in addition to causality with respect to time), 
a causality relation between the subsystems with respect to a poset. 
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2) Information Constraints on Controller: In this paper, we will be interested in the design 
of poset-causal controllers of the form: 



A K 



B k 



(2) 



K = 



C K 



D K 



We will require that the controller also be poset-causal, i.e. that K e I(P ). In later sections we 
will present a general architecture for controllers with this structure with some elegant properties. 

A control law (2) with K e I(P) is said to be poset-causal since u t depends only on xj 
for j eT i (i.e. upstream information) thereby enforcing poset-causality constraints also on the 
controller. 



Since we are dealing with poset-causal systems (with respect to the poset P = (P, <)), most 
vectors and matrices will be naturally indexed with respect to the set P (at the block level). 
Recall that every poset V has a linear extension (i.e. a total order on P which is consistent with 
the partial order <). For convenience, we fix such a linear extension of V, and all indexing of 
our matrices throughout the paper will be consistent with this linear extension (so that elements 
of the incidence algebra are lower triangular). 

Given a matrix M, M { j will as usual denote the (/, j) th entry. The i th column will be denoted 
by M'. If M is a block |P| x \P\ matrix, we will denote M(ii, ii) to be the sub-matrix of M whose 
rows and columns are in ii. We will also need to deal with the inverse operation: we will be 
given an |5 1 x \S | matrix K (indexed by some subset S Q P) and we will wish to embed it into 
a \P\ x \P\ matrix by zero-padding the locations corresponding to row and column locations in 
P\S . We will denote this embedded matrix by K. 



The controller architecture that we propose is composed of three main ingredients: 

. The notion of local variables, 

. A notion of a local product, denoted by "o", 

. A pair of operators that operate on the local variables in a way that is consistent with 
the order-theoretic structure of the poset. These operators, called the zeta operator and the 



C. Notation 



III. Ingredients of the Architecture 
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Mobius operator respectively, are classical objects and play a central role in much of order 
theory, number theory and combinatorics [7]. 

A. Local Variables and Local Products 

We begin with the notion of global variables. 

Definition 4: A function Z : P x P —> R is called a local variable. A function z : P —> R is 
called a global variable. The local variable Z is said to be consistent with the global variable z 
if Z(i, = z(i) for all i e P. 

Remark When the set P is finite it is convenient to think of local variables Z as matrices in 
W xs and global variables z as vectors in W. The local variable Z is consistent with the global 
variable z if Z u = z,-. 

Typical global variables that we encounter will be the overall state x and the input u. Note that 

the overall system is composed of s = \P\ subsystems. Subsystem i has access to components of 

the global variable corresponding to f*', and components corresponding to XX* are unavailable. 

One can imagine each subsystem maintaining a local prediction of the global variable. This 

notion is captured by the following. The i th column of Z, denoted by Z is to be thought of 

as a local prediction of z at subsystem i. The components corresponding to XX*' correspond to 

the predictions of the unknown (downstream) components of z. Note that Z, = z, so that at 

subsystem i the component z* of the global variable is available. 

We will use the indexing Z l = [Z'.] jeP , so that Z\ denotes the local prediction of z, at subsystem 

i. We will sometimes also denote Z l . by Zj(i). While local variables in general are full matrices, 

an important class of local variables that we will encounter will have the property that they are 

in TCP). The two important local variables we will encounter are X (local state variables) and 

U (local input variables). 

Example 2: We illustrate the concepts of global variables and local variables with an exam- 
ple. Consider the poset shown in Fig. 2(d). Then we can define the global variable x and a 
corresponding local variable X as follows: 



Xl 




X\ 


X\ 


X\ 


Xl 


x 2 


X = 


x 2 {\) 


Xl 


x 2 (l) 


x 2 












x 3 




JC 3 (1) 


JC 3 (1) 


x 3 


x 3 


X\ 




x 4 (l) 


x 4 (2) 


x 4 (3) 


X\ 
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We define the following important product: 

Definition 5: Let F = {F(l), . . . ,F(s)} be a collection of maps F(i) : ii x ii —> R (viewed 
as matrices). Let X be a local variable. We define the local product F o X columnwise via 



(F o xy = for all i e P. (3) 

Remark Note that if X e TCP) and Y = F o X, then it is easy to verify that Y e I(P). We call 
the matrices F(i) the local gains. Local products give rise to decoupled local relationships in 
the following natural way. Let X, Y be local variables. If they are related via Y = F o X then the 
relationship between X and Y is said to be decoupled. This is because, by definition, 

Y k = F{k)X k for all k e P. 

Thus the maps relating the pairs (X k , Y k ) are decoupled across all k e P (i.e. Y k depends only 
on X k and not on X-> for any other j ± k). 

Example 3: Continuing with Example 2, let us define the local gains by F = {F(Y), F(2), F(3), F(4)} , 
where, 



F(l) = 



Fn(l) 


F l2 (l) 


Fn{\) 


FuiD 
















F 2 i(l) 


F 22 (l) 


^23(1) 


*24(1) 


F(2) 





F 22 (2) 





F 24 (2) 


F 3 i(l) 


F^2(l) 


*3 3 (1) 


^34(1) 
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Fai(X) 


^43(1) 


^44(1) 







F 42 (2) 
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F 44 (4) 



Then 
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Definition 6: Let M e R sXs be a matrix. Define 

n,(M) = 



Ma for i < i 

U U0 (M) = 
otherwise. 



Mij for i i j 
otherwise. 



Thus the matrix M can be decomposed as 

M = M d + M uo . 

The component M d = rLj(M) simply corresponds to the "downstream component", and is the 
projection of the matrix M onto the incidence algebra I(P ) viewed as a subspace of matrices. 
The component M uo = U uo (M) corresponds to the "upstream and offstream elements" and is the 
projection onto the orthogonal complement. 

B. The Mobius and zeta operators 

We first remind the reader of two important order-theoretic notions, namely the zeta and 
Mobius operators. These are well-known concepts in order theory that generalize discrete inte- 
gration and finite differences (i.e. discrete differentiation) to posets. 

Definition 7: Let V = (P, <). The zeta matrix £ is defined to be the matrix (:?xP->R 
such that = 1 whenever j < i and zeroes elsewhere. The Mobius matrix is its inverse, 

These matrices may be viewed as operators acting on functions on the poset / : P —> R (the 
functions being expressed as row vectors). The matrices which are members of the incidence 
algebra, act as linear transformations on / in the following way: 



Note that £(/) is also a function on the poset given by 

(C(f))t = S fj- ^ 

This may be naturally interpreted as a discrete integral of the function / over the poset. 
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The role of the Mobius operator is the opposite: it is a generalized finite difference (i.e. a 
discrete form of differentiation over the poset). If / : P —* R is a local variable then the function 
jj(f) : P —* R may be computed recursively by: 



(5) 



fi for / a minimal element, 
fi ~ Zj<i otherwise. 

Example 4: Consider the poset in Figure 2(c). The zeta and the Mobius matrices are given 



by: 



lff = 



fi h h 



1 
1 1 

1 1 1 



1 
-1 1 
-1 1 



then 



{(f) = 

M/) = 



./i f\ +fi fi+fi + h 
fi fi - fi h~ fi 



h - h (2 J 



h-hm 



h - h 




h - h 



h - h - h + Si 



Fig. 3. Two posets with their Mobius operators. The functions / are functions on the the posets, and the values of fi(f) at 
element i are indicated next the the relevant elements. 



We now define modified versions of the zeta and Mobius operators that extend the actions 
of ix and £ from global variables x to local variables X. Let £ and fx be matrices as defined in 
Definition 7. 

Definition 8: Let X be a local variable. Define the operators li : R sxv —* I(P) and £ : 
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TCP) acting via 



ax) = n d (xf) 



/x(X) = U d (Xfx T ). 



(6) 



Lemma 1: The operators f and /i may be written more explicitly as 



k<i 



(7) 



k<i 



for z < j and otherwise. 

Proof: The proofs follow in a straightforward fashion from (4) and (5). ■ 
Note that if Y = fi(X) then Y is a local variable in I(P). The operator £ has the natural 
interpretation of aggregating or integrating the local variables X k for k e P, whereas [x performs 
the inverse operation of differentiation of the local variables. 

Example 5: We illustrate the action of fj. acting on a local variable. Consider the local variable 
X from Example 2. It is easy to verify that 



/x(X) = 



xi 
x 2 (l) x 2 - x 2 (l) 
x 3 (l) 






x 3 - x 3 (l) 



X4(l) *4(2) - *4(1) *4(3) - *4(1) Xi, - X4(3) - *4(2) + X4(l) 

Lemma 2: The operators (//, satisfy the following properties: 

1) are invertible restricted to I(P) and are inverses of each other so that for all local 
variables X e I(P), 

mx)) = mx)) = x 

2) fi(X) = fi(U d (X)) and £(X) = f (^(X)). 

3) Let A,Xe I(P). Then //(AX) = A/u(X), and ^(AX) = A£(X). 
Proof: 

1) The proof is by induction. For a minimal element i, it is clear from Lemma 1 that n{^{X)) 1 . = 
£(X)'j = X l j. Now suppose i is non-minimal. As the induction hypothesis, suppose the 
statement jx{[,{X)) k . = X k . is true for all k < i. We prove the assertion is true for i. By 
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Lemma 1, we have 



MOT)} = MOT)} 

k<i 

= g(X)'j - X k j (by induction hypothesis) 

k<i 

= 4 

The proof that g(jx(X)) = X is similar. 

2) For j ^ i, fi(X)'j = /j.(H d (X))'j = 0. Using Lemma 1, for j < i, it is routine to check 
inductively that p(X)'j depends only on the values of X k . for k < i. Hence p(X) = p(H d (X)). 
The proof that £(Tl d (X)) = £(X) is similar. 

3) Note that p(X) 6 J(!P), hence A/u(X) ee, and hence (A^X))) = n{A{X))) = for j £ i. 
Now note that fi(AX) = U d (AXfj. T ), so that for j < i, 

/u(AX)) = A ik (Xiu T ) k j = A ik (ju(X))) = Afi(X) k j. 

}<k<i j^k<i 

The proof that £(AX) = A((X) is similar. 

■ 

Note that if X i I(P) then £(ji(X)) = H d (X). The second part of the preceding lemma says that 
fi(X) and £(X) depend only on the components of X that lie in i.e. on H d (X). 

Since £ and p. may be interpreted as integration and differentiation operators, the first part of 
the above lemma may be viewed as a "poset" version of the fundamental theorem of calculus. 

IV. Proposed Architecture 

A. Local States and Local Inputs 

Having defined local and global variables, we now specialize these concepts to our state- 
space system (1). We will denote xj to be the true state at subsystem j. We denote xj(i) to 
be a prediction of state xj at subsystem i. The information constraints at subsystem i can be 
decoupled into three distinct cases: 

. Information about ii: At subsystem i the state information for Hi unavailable, so a (possibly 
partial) prediction of Xj for j e Hi is formed. We denote this prediction by Xj(i). Computing 
these partial predictions is the role of the controller states. The state X[ is known at 
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subsystem i. In the following discussion we will call the downstream predictions (as well 
as the true state x,) at subystem i the "free variables in the architecture". 
. Information about Ttz: Complete state information about Xj for j e p is available, so that 
Xj{i) = Xj. Moreover, the predictions from upstream x k (j) for all k e P and j < i are also 
available. 

• Information about J,|i: At subsystem i, state information about Xj for j not comparable to 
i is unavailable. The prediction of xj is computed using xj(k) for k < i. 
Analogous information constraints hold also for the inputs. At a particular subsystem, information 
about downstream inputs is not available. Consequently, we introduce the notion of prediction 
of unknown inputs, with similar notation as that for the states. These ideas can be formalized 
by defining local variables that capture the best available information at the subsystems. We 
introduce two local variables: 

1) The local state X associated with the system state x, 

2) The local input U associated with the controller input u. 

Definition 9: The local state X is a local variable that is consistent with the global state x 
(i.e. X\ = Xi for all i e P) and satisfies the following additional properties. 

1) X d := Ii d {X) are free variables 

2) The local variable X and its component X uo := Ii U o(.X) are determined from X d via 

x uo = UMX d )f) 

(8) 

x = x d + x uo = M (x d x T . 

Note that the second equation in (8) follows from the first and the fact that X = X d + X uo . This 
can be seen by noting that 

x d + UMX d )f) = m.x d )) + u uo ( M (XdX T ) = n d ( M (x d )£ T )n U0 (M(x d )£ T ) = MW r - 

We have an analogous definition for the local input: 

Definition 10: The local input U is a local variable that is consistent with the global input 
u (i.e. U\ = U{ for all i e P) and satisfies the following additional properties. 
1) U d := H d (U) are free variables 
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2) The local variable U and its component U uo = H uo (U) are determined from U d via 

u uo = n uo (ju(UdX T ) 

(9) 

U = U d + U uo = f i(U d ){ T . 

Remark Definition 9 has been carefully made so that the local variable X enjoys some desirable 
properties. At any subsystem, the local state X' can be decomposed into the downstream and 
the upstream/offstream components. We mention the properties below: 

• The collection of downstream components of X are the free variables X d . 

• The upstream and offstream components of X are determined from the free variables X d . 
Indeed, the reader may verify that as a consequence of (8), if j < i then X 1 . = Xj = (X d ) J j 
(this is intuitive since at subsystem i, xj is known if j < i). The predictions of offstream 
states are also computed using X d . Equation (8) implies that for offstream states (i.e. / and 
j uncomparable) X) = Zk<i,k<j M*)*- 

. Our setup also ensures that n(X) 1 . = if j < i or if i and j are incomparable. (This is 
because from Lemma 2, /u(X) = ix{X d ) e I(P)). We will later see that this has a natural 
interpretation. 

Example 6: Consider the poset shown in Fig. 2(d). The matrix X shown in Example 2 is 
a local state variable. The predicted partial states are .^(IX -^(i)? M(^), *4(2), x 4 (3). The plant 
states are x l ,x 2 ,x 3 ,x 4 . These collectively are the free variables X d . The components X d and X uo 
are given by 



Xi 
















X\ 


Xi 


X\ 


* 2 (1) 


x 2 
















* 2 (1) 


x 2 














x 3 (l) 















x 3 (l) 





X3 
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The reader may verify that X uo satisfies (8). Note that since subsystems 1 and 2 have the same 
information about subsystem 3 (2 and 3 are unrelated in the poset), the best estimate of x 3 at 
subsystem 2 is ^(1). 
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B. Role of n 

We now give a natural interpretation of the operator p{X) in terms of the differential improve- 
ment in predicted states with the help of an example. 

Example 7: Consider the poset shown in Fig. 4, and let us inspect the predictions of the state 
*5 at the various subsystems. The prediction of x 5 at subsystem 1 is and the prediction of 




Fig. 4. Poset showing the differential improvement of the prediction of state x 5 at various subsystems. 

x 5 at subsystem 2 is x 5 (2). The differential improvement in the prediction of x 5 at subsystem 2 
regarding the state x 5 is x 5 (2)-x 5 (l). At subsystems 3 the formula for the differential improvement 
is similar. The differential improvement in X5 at subsystem 4 is zero (since 4 is offstream with 
respect to 5). The differential improvement for x 5 at subsystem 6 is zero (since 6 is downstream 
with respect to 5). These differential improvements are depicted in Fig. 4. Capturing these 
differential improvements is precisely the role of the Mobius operator. 

We can now understand the role of (8) better. Let X be a local state and let X t denote the i th 
row of X. The f h entry of X t (i.e. Xj) corresponds to the prediction of x, at subsystem j, so that 
the row vector X t summarizes the predictions of the state x t at the different subsystems. 

Recalling (8), we have 

X = f i(X d )£ T . 

Restricting to the i th row and rewriting we have that 

k<i 
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The natural interpretation here is that in our architecture the "atoms " for state prediction are the 
differential improvements (or increments) n(Xd). To compute the state prediction at subsystem 
i one simply "integrates" (via £) the increments corresponding to upstream subsystems p. This 
is depicted in Fig. 5. 




(X d ) 5 = [ x 5 (l) x 5 (2) i B (3) x 5 J 

n(X d ) 6 = [ z s (l) x 5 (2)-x 5 (l) x 5 (3)-x 5 (l) x 5 - x 5 (3) - x 5 {2) + x 5 {l) 0] 

Fig. 5. This figure shows how the partial prediction of the state X5 is computed at subystem 4 using (8). States that are upstream 
of 4 make predictions using free variables as reflected in the free variables (X d ) 5 . The increments or differential improvements 
fiiX^s will then form the "atoms" for prediction. To compute the prediction at subsystem 4, one simply adds increments that 
are upstream with respect to 4. This computation yields Xj = fi(X) l 5 + n(X)j + n(X)^ + fj(X)* = x 5 (3) + x 5 (2) - x s (l). 

Remark As remarked earlier, our setup ensures that the increment fJ.(Xa) l j = for j < i or if 
j and i are not comparable. We briefly explain the intuition behind this. If i is upstream of 
j, both subsystem i and j have access to the full state xj and hence there is no differential 
improvement in the state prediction. Furthermore, if i and j are unrelated, then i carries no 
additional information about xj and again it is natural to have the differential improvement be 
zero. 

Before we proceed we clarify that the entries of X correspond to partial predictions. We 
clarify the notion of a partial prediction with an example. 

Example 8: Consider the system composed of three subsystems with P = {1,2, 3} with 1 < 3 
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and 2 < 3 (see Fig. 6): 







x 2 




X 3 





A n 
A 22 

A 31 A 32 A 33 



x 2 

X3 



+ 



B n u x 
5 22 u 2 
B31 B32 B33 U3 




Fig. 6. Local state information at the different subsystems. The quantities x 3 (l) and x 3 (2) are partial state predictions. 



Note that subsystem 1 has no information about the state of subsystem 2. Moreover, the state 
x\ or input u\ does not affect the dynamics of 2 (their respective dynamics are uncoupled). 
Hence the only sensible prediction of x 2 at subsystem 1 is x 2 (l) = (the situation for u 2 {\) is 
identical). However, both the states x\, x 2 and inputs u\, u 2 affect X3 and w 3 . Since x 2 and u 2 are 
unknown, the state x 3 (l) can at best be a partial prediction of X3 (i.e. x 3 (l) is the prediction of 
the component of x 3 that is affected by subsystem 1). Similarly x 3 (2) is only a partial prediction 
of X3. Indeed, one can show that x 3 (l) + x 3 (2) is a more accurate prediction of the state jc 3 , and 
when suitably designed, their sum converges to the true state X3. 

C. Control Law 

We now formally propose the following control law: 

u d = a?°KX)). (10) 

We make the following remarks about this control law. 

Remarks 1) We note that (10) specifies which amounts to specifying the input (U^)] = u t 
for all i e P. It also specifies (Ud) l j = Uj{i) for i < j which is the prediction of the input Uj 
at an downstream subsystem i. 
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2) Since F(i) is non-zero only on rows and columns in J,i, the controller respects the in- 
formation constraints. Thus for any choice of gains F(i), the resulting controller respects 
the information constraints. In this sense (10) may be viewed as a parameterization of 
controllers. 

3) The control law (10) may be alternatively written as U' d = F(k)p(X) k . The control law 
has the following interpretation. If i is a minimal element of the poset P, then /j(X) 1 = X l d , 
the vector of partial predictions of the state at i. The local control law uses these partial 
predictions with the gain F(i). If i is a non-minimal element it aggregates all the control 
laws from f|i and adds a correction term based on the differential improvement in the 
global state-prediction p(X) 1 . This correction term is precisely F(i)p(X) 1 . 

Example 9: Consider a poset causal system where the underlying poset is shown in Fig 2(d). 
The controller architecture described above is of the form U' d = Yik<i F(k)fi(X) k (where U' is a 
vector containing the predictions of the global input at subsystem z). Noting that (U^)] = u h we 
write out the control law explicitly to obtain: 
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X\ 

















«2 


= F(1) 


x 2 (D 


+ F(2) 


x 2 -x 2 (l) 


+ F(3) 





+ F(4) 









M 3 




Xi{\) 









x 3 -x 3 (l) 







M 4 




x 4 (l) 




x 4 (2)-x 4 (l) 




x 4 (3)-x 4 (l) 




x 4 - x 4 (2) - x 4 (3) + x 4 (l) 



D. State Prediction 

Recall that at subsystem i the states xj for j e Hi are unavailable and must be predicted. 
Typically, one would predict those states via an observer. However, those states are unobservable; 
only the state for k 6 |z are observable, and are in fact directly available. In this situation, 
rather than using an observer one constructs a predictor to predict the unobservable states. These 
predictions are computed by the controller via prediction dynamics, which we now specify. 

Since the dynamics of the true state evolve according to x(t) = Ax(t) + Bu(t) + w(i), each 
subsystem can simulate these dynamics using the local states and inputs. Locally each subsystem 
implements the dynamics X'(t) = AX l {t) + BU\t). This can be compactly written as 

X(t) = AX(t) + BU(t). (11) 

We remind the reader that X = X d + X uo , and that X uo (consisting of upstream and offstream 
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components) is determined from X d via (8). Consequently, the components in X uo are not free 
variables and one needs to check that (11) constitutes a consistent set of differential equations. 
Projecting (11) onto orthogonal components using H d and U uo we obtain 

X d {t) = U d (AX(t) + BU(t)) X uo {t) = U uo (AX(t) + BU(t)). 

Before checking consistency, we simplify the differential equation for X d (the "free variables"). 
The off-diagonal terms in X d correspond to the predictions of the downstream states, so that this 
is precisely the equation that governs the prediction or simulation component prescribed in Fig. 
1. Simplifying using X = X d + X uo we get 

X d =AX d (t) + BU d (t)+R(t) 

(12) 

R(t) = U d (AX U0 (t) + BUM). 

We think of R(t) as the influence of the upstream components (and also the unrelated components) 
in predicting X d . The dynamics (12) correspond to the closed-loop dynamics. 

Remark Equation (12) along with (10) formally specifies the controller. The controller states 
correspond to the off-diagonal entries of X d (i.e. the free variables of X). The number of states 
is equal to the number of intervals in the poset. 

To check the consistency of (11) we note that from (8) we have that X uo = H uo Qj.(X d )£ T ), 
whereas on the other hand from (11) X uo = H U0 (AX + BU). It is sufficient to check that these 
two expressions are the same. To do so note the following chain of equalities: 

X uo = U U0 (M(XdX T ) 

= U U0 (M (AX d + BU d + R) (from (12)) 

= U U0 ((Aiu(X d ) + Biu(U d )X T ) (since fi(R) = 0) 

= U U0 (AX + BU) (using (8), (9)). 

E. Separation Principle 

As a consequence of Lemma 2, we see that fu(X) = /u(X d ) and also /j(R) = Ap(X uo ) + Bp(U uo ) = 
0. Applying // to (12) we obtain the following modified closed-loop dynamics in the new variables 
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p(X)(t)=Ap.(X)(t) + Bp(U)(t). (13) 

Let us define A+BF = [A + BF(1), . . . , A + BF(s)} . From (10), and the fact that yu(^(Fo//(X))) = 
F o jj(X) we will momentarily see that the modified closed-loop dynamics are: 

MW) = (A + BF) Oy u(X)(0. (14) 

These dynamics describe how the differential improvements in the state evolve. If one picks 
U such that /u(U) stabilizes p{X), the differential improvements are all stabilized. Thus p(X) 
converges to zero, the state predictions become accurate asymptotically and the closed-loop is 
also stabilized. We show that (10) achieves this with an appropriate choice of local gains. 

Theorem 1: Let F(i) be chosen such that A(ii, ii) + B(ii, |z)F(z) is stable for all i € P. Then 
the control law (10) with local gains F(i) renders (13) stable. 
Proof: Since U d = £(F o /x(X)) it follows that 

ii(u d ) = n(U) = n m ° Km = F o n(X). 

The last equality follows from Lemma 2 and the fact that F o /u(X) e As a consequence, 

jx{Uy = F{i)jj.{X) 1 for all i e P. Hence the closed-loop dynamics (13) become: 

fi(X)\t) = (A + Bp(i))fi(X)\t). 

Recalling that /i(X) is a local variable so that /u(X) 1 (viewed as a vector) is non-zero only on 
ii it is easy to see that these dynamics are stabilized exactly when F(i) are picked such that 
A(ii, ii) + B(ii, ii)F(i) are stable. 

■ 

The dynamics of the different subsystems ix{X) 1 are decoupled, so that the gains G(z) may be 
picked independent of each other. This may be viewed as a separation principle. Henceforth, we 
will assume that the gains F(i) have been picked in this manner. Since the closed loop dynamics 
of the states are related by an invertible transformation (i.e. Xj = £(ju(Xd))), if the modified 
closed-loop dynamics (14) are stable, so are the closed-loop dynamics (12). 
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F. Controller Realization 

We now describe two explicit controller realizations. The natural controller realization arises 
from the closed-loop dynamics (12) along with the control law (10) to give: 

fi(X)(t)=AX d (t) + BU d (t)+R(t) 

While the above corresponds to a natural description of the controller, it is possible to specify 
an alternate realization. This is motivated from the following observation. The control input U 
depends only on p(X). Hence, rather than implementing controller states that track the state 
predictions X, it is natural to implement controller states that compute fi(X) directly. Hence an 
equivalent realization of the controller is: 



f*(X)(t)=Aix(X)(t) + B t i(U)(t) 
U d {t) = ^o t i{X)){t). 



(15) 



G. Structure of the Optimal Controller 

Consider again the poset-causal system considered in (16). Recall that the system (1) may be 
viewed as a map from the inputs w, u to outputs z, x via 

Z = P\\W + P\ 2 u 

x = P 2 \W + P 2 2U 



where 



^11 P\i 

P21 P22 



A 


I 


B 


C 





D 


I 









(16) 



(We refer the reader to [19] as a reminder of standard LFT notation used above). In this paper 
we will assume that A e I(P) and B e I(P). Indeed, this assumption ensures that the plant 
P 22 (z) = (zI-Ar l Be TCP). 
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Consider the optimal control problem: 

minimize ||P„ + P l2 K(I - P 22 ^)- 1 P 21 || 2 

K (17) 
subject to K stabilizes P, K e 

The solution K* is the ^-optimal controller that obeys the poset-causality information con- 
straints described in Section II. The solution to this optimization problem was presented in 
[10, Theorem 3]. The main idea behind the solution procedure is as follows. Using the fact 
that PiuPn <= I CP) are square and invertible (due to the availability of state feedback) it is 
possible to reparametrize the above problem via Q = K(I - P 22 K) ~ x Pi\- Indeed, this relationship 
is invertible and the incidence algebra structure ensures that Q e I CP) if and only if K e TCP). 
Using this the above optimization problem may be rewritten as: 

minimize ||P n + P 12 Q\\ 2 

Q (18) 
subject to Q e TCP). 

Using the fact that the r H 2 norm is column- separable, it is possible to decouple this optimization 
problem into a set of s optimization problems. Each optimization problem involves the solution 
to a standard Riccati equation. The solution to each yields the columns of Q* e TCP), from 
which the optimal controller K* e TCP) may be recovered. An explicit formula for the optimal 
controller and other details may be found in [9], [14]. 

In [10], we obtain matrices K(lj,lj) by solving a system of decoupled Riccati equations via 
(Kaj,ij),Q(j),P(D) = Ric(A(lj,lj),B(lj,lj),Caj),D(lj)) (we use slightly different notation 
and reversed conventions in that paper, see [10] for details). The optimal solution K* to (17) is 
related to the proposed architecture as described below in Theorem 2. Its proof is provided in 
the appendix. 

Theorem 2: The controller (15) with gains F{i) = K(ii, ii) for all i e P is the optimal solution 
to the control problem (17). 

Theorem 2 establishes that the controller architecture proposed in this paper is also optimal in 
the sense of the < H 2 norm. 
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V. A B lock-Diagram Interpretation 

We now interpret the separation principle described in Theorem 1 from a block-diagram 
perspective. We introduce some additional notation to be used in this section. If Y is a matrix, 
we denote by vec(y) to be its standard vectorization via column concatenation. Given matrices 
M and N, M ®N denotes their Kronecker product, and we recall the standard identity: 

vec(NXM T ) = (M®N)vec(X). 

Recall the definition of U d in Definition 6 as being the projection of a matrix X onto the 
incidence algebra. We will abuse notation and also define U d to be the linear operator that acts 
on the vectorization vec(X) and zeroes components whose indices do not belong to the incidence 
algebra (the usage will be clear from the context). 

The elementary blocks that appear in our block-diagram representation are the following: 

. The plant G, which maps the inputs u to the states x, 

• The transfer functions which play the role of predicting the local state variables X from the 
states Xj and inputs uj for j e ii via (12). We call all these transfer functions collectively 
the "simulator", because their role may be interpreted as that of simulating upstream states, 

. The map jx which takes as input vec(X) and computes vec(//(X)), 
. The local gains F(l), . . . F(s), 

• The map £ which takes as input vec(//(L0) and computes vec(U d ). 

In the closed-loop system all these transfer functions are interconnected as shown in Fig. 1. 
We define the opertators £ and p. as follows: 

l:=Yl d {C®I) ju:=n d (]j®I). (19) 

Since X d = £(ju(U)), note that vec(U d ) = £vec(/i([/)) (this clarifies the role of % inFig. 1). 
Similarly, vec(/j.(X)) = p,vec(X). As one might expect, X and ju are invertible restricted to I(P). 
Lemma 3: For I and jl as defined in (19) 

Proof: This follows simply by vectorizing the first and second parts of Lemma 2. ■ 
Figure 1 explains the closed-loop at block diagram level. The role of the simulator is to form 
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predictions of downstream states from the available states and inputs. These predictions vec(X) 
are then processed by jl, to produce vec(//(X)). These are then composed with the local gains 
to produce vec(//(t/)). Finally, X acts on vec(//(i7)) to produce vec(Ud). Internally within the 
simulator one can map the downstream components U d to the local input U (using (9)) via 

= (£®I)TI d (M®I)- 

It may be easily verified using (9) that ®vec(Ud) = vec(t/). The overall composition of the plant 
G and the simulator can be combined to give a transfer function which we denote by G vec as 
shown in Fig 7. It has an explicit formula given by: 



(I®G)(^®I)U d (ju®I). 



(20) 



Thus G vec is the overall tranfer function that maps \ec(U d ) (the true and predicted downstream 

inputs) to vec(X) (the local states). We illustrate this with an example. 
Example 10: For the poset in Fig. 2(a), we have 



vec(Xj) 



x\ 

*2(1) 



x 2 



vec(U d ) 



«2(1) 



«2 



Furthermore, the map G is given by G ■ 
given by: 

Gvec 



G 2 i 





G 2 2 



Gn 

G21 G22 



G21 



and the map G V ec &s defined in (20) is 



For this poset the matrices £ and n are given by: 



7 
0/00 



/ / 





G 22 



7 
0/00 



-/ / 



It is straightforward to verify that vec(X) = G vec vec(f/). The following important identity may 
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be verified for this example: 



G n 

Ct2\ G22 



G 22 



which is a block diagonal matrix. 

As indicated in Fig. 1, the collective map from vec(£/</) to vec(X) (which collects the plant G, 
0, and the simulation block into a single transfer function) is simply given by G vec . Thus the 
block-diagram in Fig. 1 can be simplified to Fig. 7. The matrix G vec satisfies the following: 




KU) 1 



Fig. 7. A simplified block-diagram representation of the control architecture. 

Theorem 3: The matrix //G vec <f is block diagonal. 

Proof: Using (20), we have 
juG v J = ® G)(<T ® ® /)| 

= (8 /)(/ ® G)(<r (8) /)n^ ® 7)IL;(f (8 7) 

= Ii d in ® 7)(f ® 7)(7 ® G)n d (ju (8) 7)IL;(f ® 7) (using (A <g> 5)(C ® D) = AC <8> 5D) 

= FW7 (8> G)n rf 0u 8> <S> 7) (using (pi ® 7)(<T <S> 7) = 7) 

= n d (7 <g> G)II rf (using Lemma 3). 
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The last expression is clearly block-diagonal. ■ 
In terms of this block-diagram approach, the role of fi and I become very transparent: it is simply 
to diagonalize the map G vec . Once this diagonalization occurs, the controller simply applies a 
set of diagonal gains to stabilize the closed-loop. This also illustrates the separation principle at 
the block-diagram level. As mentioned in the preceding discussion, the architecture illustrated 
in block-diagram Fig 7 is also optimal, in that appropriate choice of the gains F(i) yield optimal 
controllers. 

VI. Connections to the Youla parameterization 

The Youla parameterization (and the related work on purified output feedback [2], [5]) is 
intimately related to Mobius inversion. We examine their relationship in this section. We begin 
with a brief review of the Youla parameterization. We will examine the relationship in a discrete- 
time setting as this will make the presentation much simpler. Consider the system: 

x[t] = Ax[t - 1] + Bu[t - 1] + w[t - 1] 

(21) 

y[t] = x[tl 

For simplicity, we will assume that A is stable (this can be easily achieved by choosing a 
static K which is diagonal by picking K ti such that A„ + BuKu is stable, without affecting the 
set of achievable closed-loop maps). The Youla parametrization [19, pp. 221-231] exploits the 
observation that while the set of achievable closed-loop maps is linear-fractional with respect 
to the controller variable, it is affine in terms of the parameter Q := K(I - PriK) i 1 . 

In the case of state-feedback with a stable plant, the Youla parameterization reduces to a 
particularly simple form, which we now describe. At time t, using the information of the state 
x[t - 1], the controller implements a simulation P of the plant to compute a prediction of the 
state at time t via: 

x[t] =Ax[t- 1] + Bu[t- 1]. (22) 

Note that the simulation does not have access to the disturbances w and hence simply sets it to 
zero. It then uses the output of the plant x[t] (state-feedback) and then computes the difference 
x[t] - x[t] = w[t - 1]. It then processes w[t - 1] using an arbitrary causal filter Q and sets the 
input u = Qw. This can be summarized using a block diagram as shown in Fig. 8(a). 
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(a) 



(t)s[f 
(b) 



H{x)[t\ = x[t] - x[t] 



Fig. 8. (a) A block diagram interpretation of the Youla parameterization. Here it is assumed that the plant P is stable and 
state-feedback is available. The simulator P predicts the state one time step in the future, using which the disturbance w is 
reconstructed. The filter Q then implements a disturbance feed-forward policy, (b) The time axis viewed as a poset T, At time 
t— 1 the simulator predicts the state x downstream at time t. Mobius inversion reconstructs the disturbance w[t- 1]. 



We point out that the policy u = Qw is a disturbance feed-forward policy. It has been 
observed in various papers in the literature that while state-feedback policies may yield a 
complicated dependence between the closed-loop map and the controller variable, an equiva- 
lent reparametrization using disturbance feed-forward policies yields an affine dependence. For 
example, this observation was made in the context of robust optimization for linear systems in 
the framework of "purified output-feedback" [2] and also in [4, Chapter 6]. 

The key step in reformulating a feedback problem into a disturbance feed-forward problem is 
the explicit reconstruction of the disturbance w using the output of the plant and the simulator. 
This computation may be naturally viewed as a Mobius inversion operation. To the dynamic 
evolution of the discrete-time system (21) one can naturally associate the poset T = (N, <), i.e. 
the time-axis indexed by the natural numbers (equipped with the standard ordering). This poset 
is simply the linear poset indexed by the integers (see Fig. 8(b)), and the system-theoretic notion 
of causality is simply the specialization of our notion of poset-causality specialized to this poset. 

Consider the variable x[t] (i.e. the state of the system (21)) as a function on this poset. For 
elements k such that t < k, x[t] is available and for elements k such that k < t, it is unavailable. 
Indeed at the element t- 1 a prediction of x[t] may be computed via x[t] = Ax[t- 1] + Bu[t- 1] 
(this is precisely the role of the simulator P described above). Using the Mobius inversion 
formula for the linear poset T we have p(x[t]) = x[t] - x[t] = w[t - 1]. Hence the disturbance 
computation may be viewed simply as a Mobius inversion on T. 

It is possible to extend this interpretation to poset-causal systems with multiple subsystems. 
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In this case, the system of interest is of the form (21), where A,B e TCP). The poset associated 
to the dynamic evolution of this system is then the product poset TxP. An example of product 
of this is shown in Fig. 9(a). As explained in earlier sections, subsystems maintain local states 
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Fig. 9. (a) The poset P captures causality between the subsystems and the poset T captures causality with respect to time. 
Their product poset is shown on the right, (b) The Youla parameterization implements simulation, followed by Mobius inversion 
to compute the disturbances. It then implements a disturbance feedforward policy via a causal (with respect to f X T) filter Q. 



which are summarized by the local variable X. Using information available at time t - 1, the 
prediction of X[t] may be computed as: 

X[t] =AX[t- Y\ + BU[t- 1]. (23) 

However, due to the disturbance, the value of X[t] is given by 

X[t] = AX[t - 1] + BU[t - 1] + W[t - l]£ T , (24) 

where W = diag(w). The local state variable X[t] may be viewed as a function on the poset TxP, 
and hence one may define its Mobius inverse with respect to this poset. As a consequence of 
the product structure of the underlying poset, we have the following important lemma. 

Lemma 4: Suppose X[t] satisfies (24). Let fi T xp denote the Mobius operator of the poset 
T x P. Then 

PTxp(X[t]) = W[t- 11 

Proof: We will use p{X) to denote the Mobius inversion of the local variable X with respect 
to the poset P as defined in Definition 8. It is well-known [7, Proposition 5] that the Mobius 
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operator factorizes for product posets as: 

ItrxpXlt] = VrVrX[t] = fi(X)[t] - fx(X)[t] = fx(W[t - = W[t - 1]. 

■ 

Thus the role of the Mobius operator is to compute the local disturbance w,-. Once these 
disturbances are computed, one may process it using a filter Q that is causal with respect to 
T x V to obtain the input u. This is depicted in Fig. 9(b). 

VII. Conclusions 

In this paper we considered the problem of designing decentralized poset-causal controllers 
for poset-causal systems. We studied the architectural aspects of controller design, addressing 
issues such as the role of the controller states, and how the structure of the poset should affect 
the architecture. We proposed a novel architecture in which the role of the controller states 
was to locally predict the unknown "downstream" states. Within this architecture the controller 
itself performs certain natural local operations on the known and predicted states. These natural 
operations are the well-known zeta and Mobius operations on posets. 

Having proposed an architecture, we proved two of its important structural properties. The 
first was a separation principle that enabled a decoupled choice of gains for each of the 
local subsystems. The second was establishing the optimality properties of this architecture 
with respect to the •^-optimal decentralized control problem. The proposed Mobius-based 
architecture is quite natural, has very appealing interpretations, and can be easily extended to 
more complicated. These extensions will be the subject of future work. 
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Appendix 

The required optimal solution was derived in [10] [Theorem 3] and we first develop some 
notation and concepts that have been used therein. Let us define q(i) = M^Om (so that the 
vector q(i) has only those components of /i(XV. such that i < j), and <7,(z) = fJ.(X) 1 .. Note also that 

J J 

H(X) j j = Xj - Z k <j Vjik) from (7). Let us define A d (j) = A(ij, if) + B(ij, ij)F(j). The closed-loop 
dynamics (14) at subsystem j reduce to: 



Xj~lLk<j<ij(k) 




K(j) Af 2 C/) 


Xj ~lLk<j<lj(k) 


q(j) 




K(j) Af 2 (j) 


q(j) 



(0. 



(25) 



Note that from (10) (keeping in mind that u(X) 1 . = if j £ i, and that u: = U ] .), the control law 
assumes the form: " 3 1 



k<j k<j 



Xk 



Xi<k qkd) 

q{k) 



if). 
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(Recall that F {j) {k) is the f h row of the matrix F(k)). Thus the subsystems need to compute q(j) = 

fi(X)^j, (the differential improvements in the state predictions at subsystem j) to implement the 

control law. This is an important feature of the control law: the controller states correspond to 

the differential improvements p(X) rather than the state X itself. 

It may be verified from (25) and (10) that the explicit controller for subsystem j e P assumes 
the following form: 



qU)=A? 2 (j)q(j)+A c 2 [(j) 



Uj (t) = Y J F°\k) 

k<j 



k<j 

x k ~ IjKk <lk([) 
q(k) 



(26) 



(0. 



Furthermore, note that at subsystem j, //(X) !y = for j ^ i. Hence, only the states p(X)jj for 
i e iij need to be computed. Let p(X) Uj = [M^Oy]^ •• 

Proof of Theorem 2: As mentioned, the optimal controller is given by [10, Theorem 3]. 
We will show that when we pick F(i) = K(ii, ii) in (15), we recover this controller. In [10], the 
matrices A, A$, 5^, C$, IT, and U 2 were defined. It is straightforward to verify that: 



diag(A c/ (j)) = A 
diag(A£c/)) = B* 



diag(A^0"))=A<D 



k<j 



= C®q. 



J y e p 



Letting q be the vectorization of q(j) for j e P via q = [q(j)]j e p, we may rewrite the dynamics 
in (26) as 

q = A*q(t) + B 9 (x(t) - C®q(t)). (27) 
Further, note that the vectorization of the control law equation in (26) yields 



k<j 



x k ~ Yjl<k 1k(l) 
q(k) 



= c e n 2 9 + c e ni(x 

> 


- C<s>q). 


A(X> — B^C® 




c Q (u 2 - niCo) 





q, which is pre- 



Combining the above with (27), we obtain that u = 

cisely the same expression as the controller described in [10][Theorem 3]. Since this corresponds 
to the optimal < H 2 controller, we have the required result. 
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